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Discrimination task is treated in the case of only partial prior information from measurements 
of unknown states. The construction of the optimal discrimination device and estimation of un- 
known states is performed simultaneously. A communication through a noisy quantum channel is 
formulated in terms of the proposed discrimination protocol. 
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I. INTRODUCTION 



Non-orthogonality of generic quantum states as a pure 
quantum mechanical phenomenon poses a fundamental 
obstacle to discrimination between them. Impossibility 
of perfect distinguishing between non-orthogonal quan- 
tum states limits the efficiency of many quantum infor- 
mation protocols, for example communication through 
a noisy quantum channel. Historically, the first inves- 
tigation of quantum discrimination was based on von 
Neumann measurements jj]. It determined the error 
rate bound of discrimination between two pure states. 
The general Bayesian approach to distinguishing between 
mixed states with non-equal prior probabilities based on 
minimization of cost function and positive operator val- 
ued measurements (POVM) was developed by Holcvo 
and Helstrom ||, In 1987 Ivanovic published a dif- 
ferent model of two linear independent pure states dis- 
crimination without errors but with the occurrence of an 
inconclusive result [5J. Subsequently, the minimal prob- 
ability of the inconclusive result was shown by Dieks and 
Peres || |(| to be the overlap between the states being 
distinguished. Recently, discrimination techniques have 
been thoroughly studied, and analytical expressions for 
optimal POVMs for many quantum discrimination prob- 
lems have been derived § |, §, [tc], |l| || H 0, [jj. 
However analytical results cannot be easily obtained for 
discrimination problems which incorporate many quan- 
tum states without a particular symmetry and numerical 
methods appear efficient [ fL6[ |l7| . Some discrimination 
devices have been realized experimentally jl^, |l9|, |2(j. ^TJ . 

So far distinguishability has been studied in the case of 
maximal information on the states to be distinguished. 
In this classical approach it is assumed that the sources 
Hi prepare physical systems, for example electron spins 
or modes of light, in explicitly known quantum states 
p- t . Discrimination consists in distinguishing between 
these known states pi or equivalently sources Hi (they 
are called hypotheses) having a physical system from 
one randomly chosen source. The formal solution of 
this problem is based on the construction of the opti- 
mal POVM that minimizes a convenient cost function, 
for example the error rate or negative information ^ [| . 
The goal of this paper is to develop a discrimination tech- 
nique applicable to the case when only partial informa- 



tion on hypotheses Hi is available. Let us imagine that 
the quantum states pi prepared by the sources Hi are 
not explicitly known but we are allowed to probe each 
source before discrimination. We can ask what quan- 
tum states are produced by these sources and what is 
the optimal POVM to distinguish between them. This 
scenario realizes for example in communication through 
an unknown distorting quantum channel. The hypothe- 
ses are explicitly known by transmitter party (Alice) but 
not by receiver party (Bob). In the calibration stage Al- 
ice sends physical systems in the quantum states p\ from 
the sources Hi in given order down the channel while 
Bob receives sequence of distorted states pi . He acquires 
some information (prior data) on hypothesis Hi through 
a generic measurement of states pi. In communication 
stage, which follows, Alice sends a message with the help 
of her alphabet Hi. Transmission is successful if Bob is 
able to discriminate between hypothesis Hi or equiva- 
lently between quantum states pi with the help of prior 
data. 

The article is organized as follows. The standard dis- 
crimination is summarized in section O. Novel method 
of prior data manipulation for simultaneous estimation 
of the unknown states and design of the optimal discrim- 
ination POVM is proposed in section III. Two examples 
are shown in section IV. 



II. STANDARD APPROACH 

Let us consider the following discrimination protocol. 
Alice chooses in secret a quantum state from the set of 
two known quantum states |1), |2) and sends it to Bob 
who must decide which state was chosen using a POVM 
measurement. For simplicity, but without the loss of gen- 
erality, we will consider the same probability for choosing 
|1) and |2). A natural measure of Bob's failure is the er- 
ror rate ER = (N±2 + ^2i)/2iV, where N is the number 
of trials, N12 is the number of Bob's wrong decisions |1) 
when |2) was true and N21 is the number of his wrong de- 
cisions |2) when |1) was true. Nonorthogonality of states 
implies the ultimate lower limit of the error rate — the 
Helstrom bound |3j, 
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The optimal POVM measurement IIi, II2 for the dis- 
crimination between two mixed states pi, p 2 is the one 
that minimizes the error rate 



ER= ~( Tv[ll ,,),,] + Tr[U>i>{\ 



(2) 



subject to the constraint IIi + II2 = 1, or equivalently 
maximizes the functional 

IE = Tr [IIipi] + Tr [U 2 p 2 } - Tr [A (Hi + n 2 )] , (3) 

where IIi^ are now treated as independent variables. 
Variation of (|§|) subject to the positivity constraint on 
the operators IIi 2 gives the extremal equations for the 
optimal POVM elements and for the Lagrange operator 
multiplier A, 



/ oiiii = Aiii, p 2 n 2 = An 2 , 

P1LT1 + p 2 n 2 = A. 



(4) 



For example, solving extremal equations (Q) for two 
known pure states of qubit \ipi } 2) — cos a |+z)±sina \ —z) 
or corresponding mixed states 



Pi, 2 



cos 2 a 
±d cos a sin a 



id cos a sin a 1 

sin 2 a )' (5) 



parametrized by a s [0, 7r/4] and d G [0,1], yields the 
optimal POVM and ultimate error rate, 



1/1 ±1 
2 ^±1 1 

ER = -(l-dsin2a). 
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III. 



DISCRIMINATION FROM PARTIAL 
INFORMATION 



In the previous section we assumed that Bob knows the 
states pi, 2 explicitly and thus he can use this maximal 
prior information for designing the optimal discrimina- 
tion POVM. However in real communication through an 
unknown noisy channel Bob does not know these states 
perfectly. Therefore a calibration stage needs to be per- 
formed first. In this stage Alice transmits a given number 
of physical systems in state p[ from the known source 
Hi through communication channel and Bob applies a 
prior POVM measurement itk to the unknown received 
state pi. He obtains real noisy data fik that approxi- 
mate true probabilities p±k = Tr[/?i7Tfc]. The same cal- 
ibration is done for source H 2 . The prior information 
obtained by Bob during the calibration stage is equiva- 
lent to the explicit knowledge of the states pi j2 only in the 
non-physical case of tomographically complete POVM Hk 
and infinitely large number of transmitted states. The 
most simple and straightforward solution of this mod- 
ified discrimination problem is obvious. Bob can esti- 
mate the states pi, p 2 from the prior data fik, f 2 k and 
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FIG. 1: Scheme of the experimental setup used for prior mea- 
surement of quantum states in calibration stage and design of 
the optimal discrimination POVM. 



then solve the equations @) in order to obtain the opti- 
mal POVM ni 2 . Unfortunately, succession of these two 
steps does not represent the optimal strategy. If the data 
are not sufficient for accurate reconstruction of the states 
the deviations can be propagated to discrimination step. 
Another important problem arises from solution of the 
extremal equations derived from non-symmetrical mixed 
states obtained as a result of some numerical state recon- 
struction technique [H c3> HI - 

Now we will formulate the modified discrimination pro- 
tocol in the general case of I sources H i7 i = 1, . . . , /, and 
will shown information-based solution of this problem. 
Let us start with / unknown mixed states pi produced 
by the sources Hi and propagated through a noisy chan- 
nel. We observe these states in the calibration stage by 
prior POVM 7Tfc, k = 1, . . . , K , and obtain prior data fik, 
Sfc fik = 1j that approximate the true probabilities 



Pik = Tr [piir k ] 



(7) 



In the discrimination stage the states pi are examined 
in virtual sense by designed discrimination POVM Hj, 
j = 1, . . . , J. If we actually performed this measurement 
we would obtain frequencies Fij, ^V F^j = 1, sampling 
probabilities 



Tr [ Pl \l 



(8) 



For simplicity, let us consider the same number of states 
Pi and designed POVM elements Hj, I = J. In the case of 
more POVM elements than discriminated states the pro- 
posed method would generalize the Ivanovic-Dieks-Peres 
[Q, ^, ^| unambiguous discrimination or combination of 
both methods, not the pure Helstrom scheme. For opti- 
mal POVM the sum of diagonal frequencies J2i Fa should 
be maximal and the error rate F^ — y~] { Fa should be 
minimal. Thus the desired discrimination POVM should 
yield data F^ as close to Kronecker's delta Sij as possible. 
This is schematically shown in Fig. [TJ. 
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In the case of the finite number of states used for cal- 
ibration the equalities fik = Pik do not hold exactly. 



Similarly the equalities F{. 



P,, 



Sij are not sat- 



isfied because the perfect discrimination between non- 
orthogonal states is not possible. These equalities are 
satisfied only approximately and thus cannot be directly 
used for the estimation of the states pi and design of the 
optimal POVM ILy. Therefore equality between data fa-, 
Fij and probabilities pik , Pij must be handled in weaker 
- statistical - manner. We will construct a functional 
IF [/ifc , F^, Pik{Pi), Pij(pi,Ilj)] that will serve as a mea- 
sure of the distance between the data and the probabil- 
ities. It should be maximal for states pi which produce 
prior probabilities pik close to prior data as possible 
and simultaneously it should be maximal for and POVM 
IIj which produces probabilities P^ as close to the de- 
sired data F^ — 5ij as possible. There are many convex 
and additive measures of distance between probability 
distributions. We will choose the one that provides the 
most simple extremal equations — the log likelihood func- 
tional, 

iogiL[pi,rif] = ^2 5 v ln Aj + ^2 f ik ln P* fc - ( 9 ) 

ij ik 

Log likelihood as a measure of distance between quan- 
tum mechanical objects and real data arises from non- 
normalized multinomial distribution 



ijk 



>,j Pik ~ ]^[ Pa Pik 



and has unique properties from the physical 
as mathematical [^8] point of view. 

Variation of the log likelihood functional @, 

loglL^ + X u ILj + Yj] - loglLfo, Uj] = 0, 



(10) 
as well 

(11) 



for all Xj,, Yj, that yields extremal equations for the states 
Pi and the optimal POVM IP,- must preserve trace nor- 
malization of the states, Tr[p^] — 1, and completeness of 
the POVM, JVILj = 1. This necessary constraints can 
be incorporated with the help of undetermined Lagrange 
multipliers. The functional to be maximized becomes 



W[pi, Uj] = ^ S H ^ p ij + X! $ ik lnpik ~ 

ij ik 

^/XiTr^-Tr^^n,], (12) 

i 3 

where pi and II j are the independent variables. The esti- 
mated unknown states pf st and the designed optimal dis- 
crimination POVM II° pt which maximize the functional 
( |l2| ) are solutions of the extremal equations 

H^RiPiRi = Pi, X^SjUjSjX- 1 = n 

where 
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(13) 
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■Pi, (14) 
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= (Tr [RiPiRi])* , A = ( SjUjS. 



(15) 



Notice that A is positive definite operator and pi are pos- 
itive numbers. The set of nonlinear operator equations 
(O-na) can be solved by means of repeated iterations, 



(A(«))-i^" ) n^ ) ^" ) (A("))- 1 = 



(16) 
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The convexity of the log likelihood functional guaran- 
tees that these equations have only one global maximum 
or a plateau of global maxima. As initial iterations we 
can choose maximally mixed states and POVM elements 
ITj -' = -jl. Let us note that when the true states pi 
are mixed the estimated states pf* are biased towards 
pure states due to the simultaneous construction of the 
discrimination POVM Uj. The reason is that, roughly 
speaking, the rays are what is important for discrimina- 
tion, and pure states can be discriminated easier than 
mixed ones. In another words, the proposed method 
finds the discrimination POVM IL° P optimal in the sense 
of maximum likelihood, and simultaneously reconstructs 
rays corresponding to the unknown states. 



IV. EXAMPLES 

Here we illustrate our procedure on discrimination be- 
tween two qubit states pi,2, such as two spin states of 
an electron or two polarization states of a photon. We 
perform numerical simulations of the prior measurements 
and we find subsequently the states pf\ and determined 
the optimal POVM n° P 2 via iterative solution of the ex- 
tremal equations (p^jlq). In our simulations we assume 
the class of states (g) and study the dependence of the 
error rate (|J) on angle a that controls the overlap be- 
tween the two discriminated states pi,2- The error rate 
(||) computed from the true states p\^ and the optimal 
POVM U° P 2 is used for evaluating the quality of the de- 
signed POVM Hi P 2- Thus the optimality of the discrim- 
ination POVM n° P 2 obtained with the help of the pro- 
posed method is quantified by the standard widely used 
measure of discrimination success. 

In the first example we consider two symmetrical 
mixed states (||) with d = 0.9 on the Bob's side of 
the communication channel. The prior POVM con- 
sists of projective measurements in x and y directions, 
each made on V = 1000 physical systems in states 
pi t 2- This represents the calibration stage. Observa- 
tion {tt x , tt- x , TT y , n-y} is tomographically incomplete 
and thus not sufficient for estimation of mixed states. 
Nevertheless, this measurement yields some information 
on the states pi 2 and, discrimination with the error 
rate less than 50% is possible. The error rate (j^), 
i(Tr[LI^ pt p2] + TrpT^'/Oi])? of the optimal discrimination 
POVM Ii° pt is shown in Fig. 0. 
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FIG. 2: Error rate of the optimal discrimination between two 
symmetrical mixed states using partial prior information from 
the tomographically incomplete measurement (dashed), see 
text. Prior data / fluctuate around their theoretical values 
p with standard deviation y/p(l — p)/V~N, N = 1000. Error 
rate is limited by Helstrom bound (solid). 

In the second example we discriminate between the 
pure state pi = | , i/ , i)(V , i|, d = 1, and the mixed state pi 
d) with d = 3/4, having prior data from the tomograph- 
ically complete observation {tt x , tt- x , ir y , 7r_ a ,7r z , 7r_ z }. 
Each of these projective measurements in x, y and z di- 
rections are done with only N = 10 physical systems in 
states pi and p2- The optimal POVM n° P 2 can be found 
with the help of ([l6|). The corresponding error rate is 
shown in Fig. ^. In the case of an infinitely large number 
of physical systems used for prior measurement in the 
calibration stage the prior data fik attain the theoretical 
probabilities pik and the error rate approaches the bound 
(g), ER = |(l-i±^sin2a) = ±(l-f sin2a), as is shown 
in Fig. H by solid line. 

V. CONCLUSION 

Simultaneous reconstruction of quantum states and 
design of optimal quantum discrimination POVM have 
been presented. Observation on finite samples of phys- 
ical systems in unknown quantum states in the calibra- 
tion stage provides the only prior information used for 
discrimination between these quantum states. The sys- 
tem of the extremal equations ([l3] |l5|) fully determines 
the unknown states and the optimal POVM for their 
discrimination. These nonlinear operator equations can 
be solved by means of repeated iterations. This has 
been demonstrated on particular examples and verified 



in many other cases. Typically several hundreds or thou- 
sands of iterations are needed for 1CP 16 accuracy. In the 
case of finite samples of physical systems used for the 
calibration the Helstrom bound represents a lower limit 
of the achievable error rate. For tomographically com- 
plete prior observation the error rate can come very close 
to this bound even if the prior measurements are car- 
ried out on a small number of states. This means that 
in practice Alice and Bob need only a few transmitted 
quantum states for establishing nearly optimal discrim- 
ination device. The calibration stage thus can be very 
short and almost all the states reserved for communica- 
tion can be used up for transmission of useful data from 
Alice to Bob. Of course the ratio of the transmitted 
calibration states to the states used for communication 
can be adjusted in order to achieve either transmission 
with maximal speed for the given error rate or transmis- 
sion with minimal error rate for the given speed. These 
and others optimization problems can be formulated as 
a modifications of the presented quantum discrimination 
protocol. 
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FIG. 3: The error rate of the optimal discrimination be- 
tween the pure and the mixed state using the partial prior 
information from the tomographically complete measurement 
(dashed), see text. Prior data / fluctuate around their the- 
oretical values p with standard deviation y/p(l — p)/v~N, 
N = 10. Error rate is limited by Helstrom bound (solid). 
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